Approximation Algorithms for Multiple Sequence Alignment Under a Fixed Evolutionary Tree

نویسندگان

  • R. Ravi
  • John D. Kececioglu
چکیده

We consider the problem of multiple sequence alignment under a xed evolu tionary tree given a tree whose leaves are labeled by sequences nd ancestral sequences to label its internal nodes so as to minimize the total length of the tree where the length of an edge is the edit distance between the sequences labeling its endpoints We present a new polynomial time approximation algorithm for this problem and analyze its performance on regular d ary trees with d a constant On such a tree the algorithm nds a solution within a factor d d of the minimum in O kT d n k n time where k is the number of leaves in the tree n is the length of the longest sequence labeling a leaf and T d n is the time to compute a Steiner point for d sequences of length at most n A Steiner point for a set S of sequences is a sequence P that minimizes the sum of the edit distances from P to each sequence in S The time T d n is O d n given O ds time preprocessing for an alphabet of size s The approximation algorithm is conceptually simple and easy to implement and actually applies to any metric space in which a Steiner point for any xed sized set can be computed in polynomial time We also introduce a new problem bottleneck tree alignment in which the objective is to label the internal nodes of the tree so as to minimize the length of the longest edge We describe an exponential time exact algorithm for the case of unit cost edit operations and show there is a simple linear time approximation algorithm for the general case that nds a solution within a factor O log k of the minimum

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Evolutionary Algorithms for Multiple Sequence Alignment

Multiple Sequence Alignment is a crucial task in Bioinformatics. Most of the commonly used multiple alignment methods are based on a dynamic programming approach. This approach however requires time proportional to the product of the sequence lengths and also doesn’t provide an extensible platform for evaluating different objective functions. Tree-based algorithms, which combine results from pa...

متن کامل

An Application of the ABS LX Algorithm to Multiple Sequence Alignment

We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...

متن کامل

Improved Approximation Algorithms for Tree Alignment

Multiple sequence alignment is a task at the heart of much of current computaw x tional biology 4 . Several different objective functions have been proposed to formalize the task of multiple sequence alignment, but efficient algorithms are lacking in each case. Thus multiple sequence alignment is one of the most critical, essentially unsolved problems in computational biology. In this paper we ...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

Fixed Topology Alignment with Recombination

In this paper, we study a new version of multiple sequence alignment, xed topology alignment with recombination. We show that it can not be approximated within any constant ratio unless P = NP. For a more restricted version, we show that the problem is MAX-SNP-hard. This implies that there is no PTAS for this version unless P = NP. We also propose approximation algorithms for a special case, wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 88  شماره 

صفحات  -

تاریخ انتشار 1995